Weka-GDPM – Integrating Classical Data Mining Toolkit to Geographic Information Systems

نویسندگان

  • Vania Bogorny
  • Andrey Tietbohl Palma
  • Paulo Martins Engel
  • Luis Otavio Alvares
چکیده

Geographic data preprocessing is the most effort and time consuming step in spatial data mining. In order to facilitate geographic data preprocessing and increase the practice of spatial data mining, this paper presents Weka-GDPM, an interoperable module that supports automatic geographic data preprocessing for spatial data mining. GDPM is implemented into Weka, which is a free and open source classical data mining toolkit that has been widely used in academic institutions. GDPM follows the Open GIS specifications to support interoperability with Geographic Information Systems. It automatically generates data at two granularity levels without using prior knowledge and provides support for both distance and topological spatial relationships.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Reuse-based Spatial Data Preparation Framework for Data Mining

The constant increase in use of geographic data in different application domains has resulted in large amounts of data stored in spatial databases and in the desire of data mining. Many solutions for spatial data mining have been proposed. Most create data mining languages or extend existing query languages to support data mining operations. This paper presents an interoperable framework for sp...

متن کامل

Weka-STPM: from trajectory samples to semantic trajectories

Enormous quantities of trajectory data are collected from many sources, as GPS devices and mobile phones, as sequences of points. These data can be used in many application domains such as traffic management, urban planing, tourism, and bird migration. However, in most applications a higher level of abstraction should be used instead of sample points. In this paper we present an extension of th...

متن کامل

GeoSTAT - A System for Visualization, Analysis and Clustering of Distributed Spatiotemporal Data

Nowadays, there is a considerable amount of spatiotemporal data available on the web. The visualization of these data requires several visual resources which helps users to have a correct interpretation of the data set. Furthermore, the use of data mining algorithms has proven relevant in helping the exploratory analysis of spatiotemporal data. This paper proposes the GeoSTAT (GEOgraphic Spatio...

متن کامل

Weka4WS: A WSRF-Enabled Weka Toolkit for Distributed Data Mining on Grids

This paper presents Weka4WS, a framework that extends the Weka toolkit for supporting distributed data mining on Grid environments. Weka4WS adopts the emerging Web Services Resource Framework (WSRF) for accessing remote data mining algorithms and managing distributed computations. The Weka4WS user interface is a modified Weka Explorer environment that supports the execution of both local and re...

متن کامل

WSRF Services for Composing Distributed Data Mining Applications on Grids: Functionality and Performance

The Web Services Resource Framework (WSRF) has recently emerged as the standard for the implementation of Grid applications. WSRF can be exploited for developing high-level services for distributed data mining applications. This paper describes Weka4WS, a framework that extends the widely-used Weka toolkit for supporting distributed data mining on WSRF-enabled Grids. Weka4WS adopts the WSRF tec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006